# Stable Training
Ppo LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, designed to solve control tasks in the LunarLander-v2 environment.
Physics Model
P
sigalaz
20
0
Td3 MountainCarContinuous V0
A TD3 reinforcement learning agent trained based on the stable-baselines3 library, specifically designed for the MountainCarContinuous-v0 environment.
Physics Model
T
sb3
203
0
Td3 Hopper V3
This is a TD3 agent model trained using the stable-baselines3 library, specifically designed for reinforcement learning tasks in the Hopper-v3 environment.
Physics Model
T
sb3
30
0
Td3 HalfCheetah V3
This is a TD3 reinforcement learning agent trained using the stable-baselines3 library, specifically designed for the HalfCheetah-v3 environment, achieving an average reward of 9709.01.
Physics Model
T
sb3
23
0
Featured Recommended AI Models